Search CORE

139 research outputs found

Pairwise covariance adds little to secondary structure prediction but improves the prediction of non-canonical local structure

Author: Bobbie-Jo Webb-Robertson
C Bystroff
C Bystroff
C Bystroff
C Bystroff
C Bystroff
Christopher Bystroff
JA Hanley
KF Han
KT Simons
M Vingron
P Fariselli
Q Yi
U Gobel
U Hobohm
Y Fujitsuka
Y Zhang
Publication venue: BioMed Central
Publication date: 01/10/2008
Field of study

Abstract Background Amino acid sequence probability distributions, or profiles, have been used successfully to predict secondary structure and local structure in proteins. Profile models assume the statistical independence of each position in the sequence, but the energetics of protein folding is better captured in a scoring function that is based on pairwise interactions, like a force field. Results I-sites motifs are short sequence/structure motifs that populate the protein structure database due to energy-driven convergent evolution. Here we show that a pairwise covariant sequence model does not predict alpha helix or beta strand significantly better overall than a profile-based model, but it does improve the prediction of certain loop motifs. The finding is best explained by considering secondary structure profiles as multivariant, all-or-none models, which subsume covariant models. Pairwise covariance is nonetheless present and energetically rational. Examples of negative design are present, where the covariances disfavor non-native structures. Conclusion Measured pairwise covariances are shown to be statistically robust in cross-validation tests, as long as the amino acid alphabet is reduced to nine classes. An updated I-sites local structure motif library that provides sequence covariance information for all types of local structure in globular proteins and a web server for local structure prediction are available at <url>http://www.bioinfo.rpi.edu/bystrc/hmmstr/server.php</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Mining residue contacts in proteins using local structure predictions

Author: C. Bystroff
M.J. Zaki
Shan Jin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Protein local 3D structure prediction by Super Granule Support Vector Machines (Super GSVM)

Author: B Chen
B Chen
B Chen
B Zagrovic
Bernard Chen
C Bystroff
C Bystroff
C Cortes
C Sander
CC Chang
G Karp
G Wang
KF Han
KF Han
Matthew Johnson
R Kolodny
TY Lin
W Zhong
W Zhong
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Calibur: a tool for clustering large numbers of protein decoys

Author: C Bystroff
D Shortle
H Li
KS Arun
KT Simons
MR Betancourt
S Boris
S Wu
SC Li
Shuai Cheng Li
T Hamelryck
Y Zhang
Yen Kaow Ng
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Ab initio protein structure prediction methods generate numerous structural candidates, which are referred to as decoys. The decoy with the most number of neighbors of up to a threshold distance is typically identified as the most representative decoy. However, the clustering of decoys needed for this criterion involves computations with runtimes that are at best quadratic in the number of decoys. As a result currently there is no tool that is designed to exactly cluster very large numbers of decoys, thus creating a bottleneck in the analysis. Results Using three strategies aimed at enhancing performance (proximate decoys organization, preliminary screening via lower and upper bounds, outliers filtering) we designed and implemented a software tool for clustering decoys called Calibur. We show empirical results indicating the effectiveness of each of the strategies employed. The strategies are further fine-tuned according to their effectiveness. Calibur demonstrated the ability to scale well with respect to increases in the number of decoys. For a sample size of approximately 30 thousand decoys, Calibur completed the analysis in one third of the time required when the strategies are not used. For practical use Calibur is able to automatically discover from the input decoys a suitable threshold distance for clustering. Several methods for this discovery are implemented in Calibur, where by default a very fast one is used. Using the default method Calibur reported relatively good decoys in our tests. Conclusions Calibur's ability to handle very large protein decoy sets makes it a useful tool for clustering decoys in ab initio protein structure prediction. As the number of decoys generated in these methods increases, we believe Calibur will come in important for progress in the field.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ANGLOR: A Composite Machine-Learning Algorithm for Protein Backbone Torsion Angle Prediction

Author: AG de Brevern
AG de Brevern
C Branden
C Bystroff
C Mooney
C Zhang
CJC Burges
David Jones
DT Jones
H Chen
MH Zaman
MJ Wood
MV Berjanskii
NC Fitzkee
O Dor
O Zimmermann
R Karchin
R Kuang
S Haykin
S Neal
S Wu
S Wu
S Wu
SF Altschul
Sitao Wu
U Hobohm
V Vapnik
W Kabsch
Y Zhang
Y Zhang
Y Zhang
Yang Zhang
YM Huang
Publication venue: Public Library of Science
Publication date: 15/10/2008
Field of study

We developed a composite machine-learning based algorithm, called ANGLOR, to predict real-value protein backbone torsion angles from amino acid sequences. The input features of ANGLOR include sequence profiles, predicted secondary structure and solvent accessibility. In a large-scale benchmarking test, the mean absolute error (MAE) of the phi/psi prediction is 28°/46°, which is ∼10% lower than that generated by software in literature. The prediction is statistically different from a random predictor (or a purely secondary-structure-based predictor) with p-value <1.0×10−300 (or <1.0×10−148) by Wilcoxon signed rank test. For some residues (ILE, LEU, PRO and VAL) and especially the residues in helix and buried regions, the MAE of phi angles is much smaller (10–20°) than that in other environments. Thus, although the average accuracy of the ANGLOR prediction is still low, the portion of the accurately predicted dihedral angles may be useful in assisting protein fold recognition and ab initio 3D structure modeling

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

KU ScholarWorks

PubMed Central

Characterization of S3Pvac Anti-Cysticercosis Vaccine Components: Implications for the Development of an Anti-Cestodiasis Vaccine

Author: A Flisser
A Plancarte
A Toledo
A Toledo
AE Gonzalez
Aline S. de Aluja
AO Bush
B Gottstein
Beatriz Hernández
C Bystroff
C Bystroff
C Cruz-Revilla
C Garcia-Allan
C Gauci
C Larralde
C Notredame
Caris M. Nunes
David Joseph Diemert
Dunia Rassy
E Assana
E Montero
E Nascimento
E Sciutto
E Sciutto
E Sciutto
E Sciutto
Edda Sciutto
EP Hoberg
F Boudet
G Fragoso
G Rosas
G Rosas
G Rosas
Gabriela Rosas
Germano F. Biondi
Gladis Fragoso
J Mariaux
J Morales
JA Onyango-Abuje
Jacquelynne Cervantes
JD Smyth
JK Smith
JL Molinari
JS Suk
Juan P. Laclette
Julio Morales
K Manoutcharian
K Manoutcharian
K Willms
Klaus Brehm
LJ Harrison
M Huerta
M Spiliotis
MA Diaz
MA Gemmell
MA Larkin
Marisela Hernández
MD Rickard
ME Everhart
ME Patarroyo
MW Lightowlers
MW Lightowlers
MW Lightowlers
N Goldman
Nelly Villalobos
PM Schantz
R Freeman
R Segura-Velazquez
Raúl J. Bobes
RM Parkhouse
S Leon-Cabrera
Saúl Pedraza
SF Altschul
SL Chang
Victor H. Anaya
Z Yang
Publication venue: Public Library of Science
Publication date: 23/06/2010
Field of study

Background: Cysticercosis and hydatidosis seriously affect human health and are responsible for considerable economic loss in animal husbandry in non-developed and developed countries. S3Pvac and EG95 are the only field trial-tested vaccine candidates against cysticercosis and hydatidosis, respectively. S3Pvac is composed of three peptides (KETc1, GK1 and KETc12), originally identified in a Taenia crassiceps cDNA library. S3Pvac synthetically and recombinantly expressed is effective against experimentally and naturally acquired cysticercosis.Methodology/ Principal Findings: In this study, the homologous sequences of two of the S3Pvac peptides, GK1 and KETc1, were identified and further characterized in Taenia crassiceps WFU, Taenia solium, Taenia saginata, Echinococcus granulosus and Echinococcus multilocularis. Comparisons of the nucleotide and amino acid sequences coding for KETc1 and GK1 revealed significant homologies in these species. The predicted secondary structure of GK1 is almost identical between the species, while some differences were observed in the C terminal region of KETc1 according to 3D modeling. A KETc1 variant with a deletion of three C-terminal amino acids protected to the same extent against experimental murine cysticercosis as the entire peptide. on the contrary, immunization with the truncated GK1 failed to induce protection. Immunolocalization studies revealed the non stage-specificity of the two S3Pvac epitopes and their persistence in the larval tegument of all species and in Taenia adult tapeworms.Conclusions/ Significance: These results indicate that GK1 and KETc1 may be considered candidates to be included in the formulation of a multivalent and multistage vaccine against these cestodiases because of their enhancing effects on other available vaccine candidates

Public Library of Science (PLOS)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

PubMed Central

Prediction of backbone dihedral angles and protein secondary structure using support vector machines

Author: AG de Brevern
AG Murzin
AK Jain
AP Dempster
B Oliva
B Rost
B Rost
B Rost
B Xue
BH Park
BW Matthews
C Bystroff
C Bystroff
C Mooney
CB Anfinsen
CC Chang
CW Hsu
D Frishman
D Przybylski
DT Jones
DT Jones
E Faraggi
FM Richards
G Karypis
G Pollastri
GN Ramachandran
H Kim
IH Witten
J Guo
J Kyte
J MacQueen
JA Cuff
JA Cuff
JJ Ward
Jonathan D Hirst
JR Green
K Karplus
K Lin
KY Yeung
M Ouali
MJ Rooman
MJ Wood
N Cristianini
N Qian
O Dor
O Zimmermann
O Zimmermann
Petros Kountouris
PY Chou
Q Dong
R Karchin
R Kuang
S Henikoff
S Hua
S Qin
S Wu
SC Lovell
SF Altschul
SK Riis
U Hobohm
V Vapnik
W Kabsch
XM Pan
Y Xu
YM Huang
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background The prediction of the secondary structure of a protein is a critical step in the prediction of its tertiary structure and, potentially, its function. Moreover, the backbone dihedral angles, highly correlated with secondary structures, provide crucial information about the local three-dimensional structure. Results We predict independently both the secondary structure and the backbone dihedral angles and combine the results in a loop to enhance each prediction reciprocally. Support vector machines, a state-of-the-art supervised classification technique, achieve secondary structure predictive accuracy of 80% on a non-redundant set of 513 proteins, significantly higher than other methods on the same dataset. The dihedral angle space is divided into a number of regions using two unsupervised clustering techniques in order to predict the region in which a new residue belongs. The performance of our method is comparable to, and in some cases more accurate than, other multi-class dihedral prediction methods. Conclusions We have created an accurate predictor of backbone dihedral angles and secondary structure. Our method, called DISSPred, is available online at <url>http://comp.chem.nottingham.ac.uk/disspred/</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Model based dynamics analysis in live cell microtubule images

Author: A Altinok
A Goncalves
A Janulevicius
A Krogh
A Rao
Alphan Altınok
Austin J Peck
B Alberts
BS Manjunath
C Bystroff
D Panda
Erkan Kiris
G Danuser
G Margolin
J Sethian
JC Augustinack
JM Bunker
JM Bunker
K Kamath
K Karplus
K Shafique
Kenneth Rose
L Rabiner
Leslie Wilson
LR Bahl
M Bicego
M Brand
M El-Saban
M Jiang
ML Gupta
P Maddox
R Durbin
RA Crowther
RA Walker
S Hadjidemetriou
SC Feinstein
SF Levy
Stuart C Feinstein
T Crowther
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Background: The dynamic growing and shortening behaviors of microtubules are central to the fundamental roles played by microtubules in essentially all eukaryotic cells. Traditionally, microtubule behavior is quantified by manually tracking individual microtubules in time-lapse images under various experimental conditions. Manual analysis is laborious, approximate, and often offers limited analytical capability in extracting potentially valuable information from the data. Results: In this work, we present computer vision and machine-learning based methods for extracting novel dynamics information from time-lapse images. Using actual microtubule data, we estimate statistical models of microtubule behavior that are highly effective in identifying common and distinct characteristics of microtubule dynamic behavior. Conclusion: Computational methods provide powerful analytical capabilities in addition to traditional analysis methods for studying microtubule dynamic behavior. Novel capabilities, such as building and querying microtubule image databases, are introduced to quantify and analyze microtubule dynamic behavior

Crossref

Springer - Publisher Connector

PubMed Central

OpenMETU (Middle East Technical University)

Protein structure search and local structure characterization

Author: A Andreeva
AC Camproux
AG de Brevern
AG de Brevern
AG de Brevern
AR Ortiz
B Offmann
B Rost
C Benros
C Bystroff
CA Orengo
D Baker
E Appella
F Birzele
F Guyon
G Pollastri
HM Berman
IN Shindyalo
J Garnier
J Schuchhardt
J Vesanto
JA Hartigan
JM Yang
JS Fetrow
L Holm
M Carpentier
M Dudev
M Tyagi
M Tyagi
M Tyagi
NJ Mulder
O Sander
R Unger
S Henikoff
Shih-Yen Ku
T Madej
TL Bailey
TM Mitchell
TN Petersen
U Hobohm
VS Gowri
W Humphrey
WM Zheng
WR Pearson
Y Liu
Y Ye
Yuh-Jyh Hu
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Structural similarities among proteins can provide valuable insight into their functional mechanisms and relationships. As the number of available three-dimensional (3D) protein structures increases, a greater variety of studies can be conducted with increasing efficiency, among which is the design of protein structural alphabets. Structural alphabets allow us to characterize local structures of proteins and describe the global folding structure of a protein using a one-dimensional (1D) sequence. Thus, 1D sequences can be used to identify structural similarities among proteins using standard sequence alignment tools such as BLAST or FASTA. Results We used self-organizing maps in combination with a minimum spanning tree algorithm to determine the optimum size of a structural alphabet and applied the k-means algorithm to group protein fragnts into clusters. The centroids of these clusters defined the structural alphabet. We also developed a flexible matrix training system to build a substitution matrix (TRISUM-169) for our alphabet. Based on FASTA and using TRISUM-169 as the substitution matrix, we developed the SA-FAST alignment tool. We compared the performance of SA-FAST with that of various search tools in database-scale search tasks and found that SA-FAST was highly competitive in all tests conducted. Further, we evaluated the performance of our structural alphabet in recognizing specific structural domains of EGF and EGF-like proteins. Our method successfully recovered more EGF sub-domains using our structural alphabet than when using other structural alphabets. SA-FAST can be found at <url>http://140.113.166.178/safast/</url>. Conclusion The goal of this project was two-fold. First, we wanted to introduce a modular design pipeline to those who have been working with structural alphabets. Secondly, we wanted to open the door to researchers who have done substantial work in biological sequences but have yet to enter the field of protein structure research. Our experiments showed that by transforming the structural representations from 3D to 1D, several 1D-based tools can be applied to structural analysis, including similarity searches and structural motif finding.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Improving model construction of profile HMMs for remote homology detection through structural alignment

Author: A Andreeva
A Bateman
A Krogh
A Krogh
AC Camproux
Alberto MR Dávila
B Brejova
B Knudsen
B Qian
C Bystroff
C Do
C Notredame
D Feng
D Haft
F Altschul
F Goyon
Gerson Zaverucha
H Mamitsuka
I Letunic
J Espadaler
J Gough
J Park
J Shi
J Söding
J Thompson
JD Thompson
JR Beck
Juliana S Bernardes
K Bae
K Karplus
K Karplus
K Katoh
K Lin
K Mizuguchi
K Sjolander
L Holm
L Rabiner
M Gribskov
M Helen
M Madera
M Mendel
M Wistrand
M Wistrand
O Sullivan
P Bourne
P Nuin
R Edgar
R Hughey
R Hughey
R Karchin
S Altschul
S Eddy
S Jones
T Attwood
T Mitchell
V Alexandrov
Vítor S Costa
W Majoros
W Taylor
WR Pearson
Y Hou
Y Hou
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Remote homology detection is a challenging problem in Bioinformatics. Arguably, profile Hidden Markov Models (pHMMs) are one of the most successful approaches in addressing this important problem. pHMM packages present a relatively small computational cost, and perform particularly well at recognizing remote homologies. This raises the question of whether structural alignments could impact the performance of pHMMs trained from proteins in the <it>Twilight Zone</it>, as structural alignments are often more accurate than sequence alignments at identifying motifs and functional residues. Next, we assess the impact of using structural alignments in pHMM performance. Results We used the SCOP database to perform our experiments. Structural alignments were obtained using the 3DCOFFEE and MAMMOTH-mult tools; sequence alignments were obtained using CLUSTALW, TCOFFEE, MAFFT and PROBCONS. We performed leave-one-family-out cross-validation over super-families. Performance was evaluated through ROC curves and paired two tailed t-test. Conclusion We observed that pHMMs derived from structural alignments performed significantly better than pHMMs derived from sequence alignment in low-identity regions, mainly below 20%. We believe this is because structural alignment tools are better at focusing on the important patterns that are more often conserved through evolution, resulting in higher quality pHMMs. On the other hand, sensitivity of these tools is still quite low for these low-identity regions. Our results suggest a number of possible directions for improvements in this area.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central